# Optical Character Recognition
NVLM D 72B
NVLM 1.0 is a series of cutting-edge multimodal large language models that achieve state-of-the-art results in vision-language tasks, comparable to leading proprietary and open-access models.
Image-to-Text
Transformers English

N
nvidia
14.33k
769
Trocr Large Str
TrOCR is a Transformer-based optical character recognition model designed for single-line text images, fine-tuned on multiple standard datasets.
Text Recognition
Transformers

T
microsoft
571
17
Trocr Small Stage1
TrOCR is a Transformer-based pre-trained optical character recognition model that adopts an encoder-decoder architecture, suitable for OCR tasks on single-line text images.
Image-to-Text
Transformers

T
microsoft
3,713
12
Featured Recommended AI Models